On Collocations and Their Interaction with Parsing and Translation

نویسنده

  • Violeta Seretan
چکیده

We address the problem of automatically processing collocations—a subclass of multi-word expressions characterized by a high degree of morphosyntactic flexibility—in the context of two major applications, namely, syntactic parsing and machine translation. We show that parsing and collocation identification are processes that are interrelated and that benefit from each other, inasmuch as syntactic information is crucial for acquiring collocations from corpora and, vice versa, collocational information can be used to improve parsing performance. Similarly, we focus on the interrelation between collocations and machine translation, highlighting the use of translation information for multilingual collocation identification, as well as the use of collocational knowledge for improving translation. We give a panorama of the existing relevant work, and we parallel the literature surveys with our own experiments involving a symbolic parser and a rule-based translation system. The results show a significant improvement over approaches in which the corresponding tasks are decoupled.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extracting collocations and their translations from parallel corpora

Identifying collocations in a text (e.g., break record) and correctly translating them (battre record vs. *casser record) represent key issues in machine translation, notably because of their prevalence in language and their syntactic flexibility. This article describes a method for discovering translation equivalents for collocations from parallel corpora, aimed at increasing the lexical cover...

متن کامل

Collocation translation based on sentence alignment and parsing

To date, substantial efforts have been devoted to the extraction of collocations from text corpora. However, only a few works deal with the subsequent processing of results in order for these to be successfully integrated into the NLP applications that could benefit from them (e.g., machine translation). This paper presents an accurate method for identifying translation equivalents of collocati...

متن کامل

Sentence Analysis and Collocation Identification

Identifying collocations in a sentence, in order to ensure their proper processing in subsequent applications, and performing the syntactic analysis of the sentence are interrelated processes. Syntactic information is crucial for detecting collocations, and vice versa, collocational information is useful for parsing. This article describes an original approach in which collocations are identifi...

متن کامل

Investigating the Relationship Between Iranian EFL Learners’ Use of Strategies in Collocating Words and Their Proficiency Level

This study investigated the relationship between Iranian EFL learners’ use of strategies in producing English collocations and their proficiency level. Participants were 115 undergraduate university students at 3 proficiency levels, that is, low, intermediate, and high, majoring in English language at the Faculty of Letters and Humanties at Shahid Chamran University of Ahvaz, Iran. Their select...

متن کامل

The Effects of Collaborative and Individual Output Tasks on Learning English Collocations

  One of the most problematic areas in foreign language learning is collocation. It is often seen as arbitrary and an overwhelming obstacle to the achievement of nativelike fluency. Current second language (L2) instruction research has encouraged the use of collaborative output tasks in L2 classrooms. This study examined the effects of two types of output tasks (editing and cloze) on the learni...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Informatics

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2014